Reinforcement Learning for Biped Robot

نویسندگان

Yutaka Nakamura

Masa-aki Sato

Shin Ishii

چکیده

Animal rhythmic movements such as locomotion are considered to be controlled by neural circuits called central pattern generators (CPGs), which generate oscillatory signals. Motivated by such a biological mechanisms, rhythmic movements controlled by CPG has been studied. As an autonomous learning framework for the CPG controller, we propose an reinforcement learning method , which is called the CPG-actor-critic method. We apply this method to the reinforcement learning for the biped robot. The computer simulation shows that our method is able to train the CPG such that the biped robot walks stably. We also examine the characteristic of this CPG controller.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Episodic Reinforcement Learning Control Approach for Biped Walking

This paper presents a hybrid dynamic control approach to the realisation of humanoid biped robotic walk, focusing on the policy gradient episodic reinforcement learning with fuzzy evaluative feedback. The proposed structure of controller involves two feedback loops: a conventional computed torque controller and an episodic reinforcement learning controller. The reinforcement learning part inclu...

متن کامل

Dynamic Control Algorithm for Biped Walking Based on Policy Gradient Fuzzy Reinforcement Learning

This paper presents a novel dynamic control approach to acquire biped walking of humanoid robots focussed on policy gradient reinforcement learning with fuzzy evaluative feedback . The proposed structure of controller involves two feedback loops: conventional computed torque controller including impact-force controller and reinforcement learning computed torque controller. Reinforcement learnin...

متن کامل

Poincaré-Map-Based Reinforcement Learning For Biped Walking

We propose a model-based reinforcement learning algorithm for biped walking in which the robot learns to appropriately modulate an observed walking pattern. Viapoints are detected from the observed walking trajectories using the minimum jerk criterion. The learning algorithm modulates the via-points as control actions to improve walking trajectories. This decision is based on a learned model of...

متن کامل

Biped Balance Control by Reinforcement Learning

This work studied biped walking with single (one-leg) support and balance control using reinforcement learning. The proposed Q-learning algorithm makes a robot learn to walk without any previous knowledge of dynamics model. This balance control with single support shifts the Zero Moment Point (ZMP) of the robot to a stable region over walking sequences by means of learned gestures. Hence, the p...

متن کامل

Path Planning for a Statically Stable Biped Robot Using PRM and Reinforcement Learning

In this paper path planning and obstacle avoidance for a statically stable biped robot using PRM and reinforcement learning is discussed. The main objective of the paper is to compare these two methods of path planning for applications involving a biped robot. The statically stable biped robot under consideration is a 4-degree of freedom walking robot that can follow any given trajectory on fla...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2003

Reinforcement Learning for Biped Robot

نویسندگان

چکیده

منابع مشابه

Episodic Reinforcement Learning Control Approach for Biped Walking

Dynamic Control Algorithm for Biped Walking Based on Policy Gradient Fuzzy Reinforcement Learning

Poincaré-Map-Based Reinforcement Learning For Biped Walking

Biped Balance Control by Reinforcement Learning

Path Planning for a Statically Stable Biped Robot Using PRM and Reinforcement Learning

عنوان ژورنال:

اشتراک گذاری